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METHOD AND SYSTEM FOR IMPLEMENTING OSPF REDUNDANCY 

Background of the Invention 

L Field of the Invention 

This invention relates to network communications and more particularly to 
redundancy of routing protocols, such as the Open Shortest Path First ("OSPF") protocol 
and apparatus for protecting protocol services of a router and neighbor routers from 
failure. 

2. Related Art 

The Internet Protocol ("IP") is the foundation for many public, such as the 
Internet, and private, such as a corporate Intranet, data networks. Convergence of voice, 
data and multimedia networks has also been largely based on IP-based protocols. 

Data packets progress through the data networks by being sent from one machine 
to another towards their destination. Routers or other types of switches are used to route 
the data packets over one or more links between a data source, such as a customer's 
computer connected to the data network, and a destination. Routing protocols such as 
Border Gateway Protocols ("BGP"), Routing Information Protocol ("RIP"), and Open 
Shortest Path First Protocol ("OSPF") enable each machine to understand which other 
machine is the "next hop" that a packet should take towards its destination. Routers use 
the routing protocols to construct routing tables. Thereafter, when a router receives a 
data packet and has to make a forwarding decision, the router "looks up" in the routing 
table the next hop machine. Conventionally, the routers look up the routing table using 
the destination IP address in the data packet as an index. 

In the basic OSPF algorithm, a router broadcasts a hello packet including the 
router's own ID, neighbors' IDs the router knows and also receives such messages from 
other routers. If a router receives a Hello packet, which includes its own ID, from 
another router that the router has been aware of, on the understanding that the two routers 
have become aware of each other, the two routers exchange network link-state 
information by sending routing protocol packets. The router creates a routing table based 
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on the network link-state information collected by running the link-state routing 
algorithm, typically the Dijkstra algorithm. In OSPF, the routing table can specify the 
least-cost path, based on a cost determined by considering many factors including 
network link bandwidth , as the packet route. When a network link changes, each router 
5 calculates the shortest path for itself to each of the networks and sets its own routing table 
accordingly to the paths. A route calculation unit is used for creating a routing table. 

Each router, while it transmits or receives control packets and network link-state 
information, manages the states of other routers on the network to which this router is 
connected and also manages the states of the interfaces through which this router is 
10 connected to networks. With regard to the states of routers, each router manages the 

routers' ID's, and checks if each of those routers is aware of this router, or checks if each 
of those routers has completed the transmission and reception of network link-state 
S information. With regard to interface state, each router manages the addresses of the 

Hi interfaces and other routers connected to a network to which an interface is connected, 

g 15 When conventional IP edge routers lose their primary circuitry and operation falls 

SI back to a redundant controller, a five to fifteen minute outage ensues while the router 

P releases the routing states and packet forwarding tables. In order to enhance the 

; j reliability of the router device, it is important to multiplex the above-mentioned route 

calculation units. The multiplex router device includes a plurality of route calculation 
II 20 units, and always has one route calculation unit placed in the active mode to make it 
1 execute an ordinary process while keeping the remaining route calculation units in a 

standby mode. When the route calculation unit in the active mode runs into trouble, the 
multiplex router device brings one of the waiting route calculation units into the active 
mode (this is referred to as a system switchover of route calculation units), and the one 
25 other route calculation unit takes over and continues to execute the process that was 
previously being executed by the route calculation unit in trouble. 

U.S. Patent No. 6,049,524 describes a multiplex router device which reduces the 
amount of information to be transmitted from a route calculation unit in operation to a 
route calculation unit in standby mode. The route calculation unit in the active mode is 
30 connected by an internal bus to the route calculation unit in the standby mode. The route 
calculation unit in the active mode stores network link state information showing 
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connections of the router and other routers with networks, neighboring router states 
showing states of neighboring routers and interface states showing states of network 
interfaces to connect the multiplex router device to the network. The route calculation 
unit in the active mode sends to the route calculation unit in the standby mode only the 

5 network link state information. In the route calculation unit in the standby mode, a 

database integration module that received the link-state information stores its contents in 
a link-state database. When a failure occurs in the route calculation unit in the active 
mode, the route calculation unit performs the routing protocol process by using the stored 
link-state database, so it is not necessary to exchange information with other routers to 

10 collect the network link state information over again. For awhile after the switchover to 
active mode the route calculation unit has no information about the neighbor route state 
and interface state. Hello packets are transmitted from the route calculation unit brought 
into the active state. The route calculation brought into the active state gradually 
accumulates information about the neighbor router states and interface states in order to 

15 gradually bring a complete list of ID's of other routers which is included in later Hello 
packets that the route calculation unit sends out. 

It is desirable to provide high network availability by providing improved 
redundancy which can be implemented as a link level protocol running over IP having a 
backup link level process in total real time synchronization with an active one in order to 

20 enable an expeditious switchover when a failure occurs on the active control card. 

Summary of the Invention 

The present invention relates to a method and system for implementing link level 
protocol redundancy in a router. In particular, the invention relates to providing 

25 redundancy of the Open Shortest Path First (OSPF) routing protocol. An active processor 
provides OSPF operations. In the present invention, a standby processor is coupled to the 
active processor. During an initial synchronization, all network link protocol information 
from the active processor is forwarded to the standby processor. The network link 
information can include OSPF state information, OSPF configuration information, OSPF 

30 adjacencies information, OSPF interface information and OSPF global protocol 
information. Thereafter, any updates of network link protocol information are 
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immediately forwarded to the standby processor in an orderly and controlled manner. 
Upon failure of the active processor, the router is switched to the standby processor and 
all OSPF protocol operations are performed on the standby processor. In the present 
invention, all states of the link protocol immediately function as if a failure had not 

5 occurred. Neighbor routers will not notice any difference after switch-over, and no 
additional information is needed from neighbor routers after the switch-over. 
Accordingly, the router's forwarding capability will remain unaffected and a neighbor 
router will not notice that a system failure has occurred. 

In an embodiment of the present invention, a hidden OSPF interface is determined 

10 at the active processor and the standby processor for each area of the router during the 
initial synchronization. The hidden interface is considered a point-to-point unnumbered 
interface which is not exposed to the outside world. A link-state database of the active 
processor is synchronized with the standby processor using the hidden OSPF interface. 
Link-protocol information is also forwarded from the active processor to the standby 

15 processor over the hidden OSPF interface. Upon synchronization of the standby 

processor with the active processor, the hidden OSPF interface for each area is removed. 

In the present invention the active and standby OSPF processors stay in a highly 
synchronized state, referred to as a hot-standby state. Accordingly, an expeditious 
switchover to the standby processor occurs when the active processor fails. 

20 

The invention will be more fully described by reference to the following 
drawings. 

Brief Description of the Drawings 
25 Fig. 1 is a schematic diagram of a system for implementing OSPF redundancy. 

Fig. 2 is a schematic diagram of a redundancy software implementation. 

Fig. 3 is a schematic diagram of an implementation of a hidden interface for each 
OSPF area. 

Fig. 4 is a schematic diagram of states of an OSPF process running on the active 
30 OSPF control card. 
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Fig. 5 is a flow diagram of steps for transfer of network link state information 
from an active process to a standby process. 

Detailed Description 

5 Reference will now be made in greater detail to a preferred embodiment of the 

invention, an example of which is illustrated in the accompanying drawings. Wherever 
possible, the same reference numerals will be used throughout the drawings and the 
description to refer to the same or like parts. 

Fig. 1 is a schematic diagram of a system for implementing link protocol 
10 redundancy in a router 10 in accordance with the teachings of the present invention. 

Router 11 includes active OSPF control card 12. Active OSPF control card 12 performs 
OSPF operations. OSPF operations include mechanisms for building maintaining and 
S verifying one or more adjacencies 14 to one or more neighbor routers 15, exchanging 

8 network information with neighbors and updating best network routes to a local routing 

g 15 table. When a link-state database of two neighboring routers is synchronized, the routers 

2 are referred to as adjacent. Adjacencies control distribution of routing-protocol packets 
P which are sent and received only at adjacencies. 

3 Standby OSPF control card 1 8 is removably coupled to router 1 1 . In the absence 
of standby OSPF control card 18, active OSPF control card 12 operates in a non- 

IJ 20 redundant mode. Active OSPF control card 12 communicates network link protocol 
T information 15 over communication channel 16 to standby OSPF control card 18. 

Preferably, communication channel 16 is a fast and reliable communication channel. For 
example, communication channel 16 can be a duplex Ethernet. Network link protocol 
information 15 can be forwarded in the form of Inter Process Control (IPC) messages. 
25 The same redundancy software for OSPF operations 19 runs on both active OSPF control 
card 12 and standby OSPF control card 18. Redundancy software for OSPF operations 
19 controls updating of network link protocol information 15 between active OSPF 
control card 12 and standby OSPF control card 18 and distinguishes between an active 
mode and a backup mode using system state information, as described in more detail 
30 below. 
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One embodiment of the present invention utilizes OSPF protocols running on the 
Amber Network ASR2000 router (or, alternatively, the ASR2020). The Amber Network 
ASR2000 and ASR2020 technical manuals are incorporated herein by reference as if 
fully set out. Active OSPF control card 12 and standby OSPF control card 18 are 
5 processors which are coupled to a line card and ASIC driver of router 1 1 . It will be 
appreciated that although system 10 is described in terms of the OSPF protocol the 
teachings of the present invention can be used with other conventional link protocols. 

After standby OSPF control card 18 is coupled to router 1 1, an initial 
synchronization is performed as a bulk update of network link information 15 from 
10 running active OSPF control card 12 to standby OSPF control card 18 using redundancy 
software for OSPF operations 19. Network link information 15 can include 
configuration, state and learned information. 
f| After the initial synchronization, ospf active and standby processes become fully 

^ redundant, an OSPF process running in the redundancy software for OSPF operations 19 

p 15 operates in an incremental updating mode. Updates can be posted to active OSPF control 

card 12. All updates are forwarded to standby OSPF control card 18. Standby OSPF 
l f control card 18 receives all OSPF messages and updates in order to maintain total real 

3 time synchronization between active OSPF control card 12 and standby OSPF control 
^ card 18. Accordingly, standby OSPF control card 18 mirrors active OSPF control card 

U 20 12 for implementing redundancy. In this state, referred to as hot-standby, active OSPF 
X control card 12 and standby OSPF control card 18 maintain a substantially synchronous 

state. Thereafter, if a failure of active OSPF control card 12 occurs, standby OSPF 
control card 1 8 will become active and be capable of immediately taking over all 
operations which were previously performed by active OSPF control card 12. 
25 Fig. 2 illustrates a detailed schematic diagram of redundancy software for OSPF 

operations 19 of active OSPF control card 12 and standby OSPF control card 18. 
Redundant card manager (RCM) 20 is a task that maintains a synchronization state 
machine for each task. All tasks of redundancy software for OSPF operations 19 of 
active OSPF control card 12 interact with RCM 20 to send network link information 15 
30 to standby OPF control card 18. OSPF task 21 is a task for determining a status of OSPF 
processes running on active OSPF control card 12. Software redundancy manager 22 is a 
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module that interacts with RCM 20 for determining switching over from an active state in 
which active OSPF control card 12 performs OSPF operation to a standby state in which 
standby OSPF control card 18 takes over OSPF operations. 

During an initial synchronization, redundant card manager (RCM) 20 on standby 
5 OSPF control card 18 contacts OSPF task 21 on active OSPF control card 12 for 

retrieving task information. OSPF task 21 on active OSPF control card 12 automatically 
processes OSPF messages and calculates routes stored in routing table manager (RTM) 
34. Active OSPF control card 12 marks corresponding internal states and transfers link- 
state database information 23 , OSPF state information 24 and OSPF configuration 
10 information 25, OSPF adjacencies information 26, OSPF interface information 27 and 
OSPF global protocol information 28 to backup OSPF control card 18 through RCM 20. 

During the initial synchronization, locks can be used with active OSPF processes 
running on active OSPF control card 12. For example, on active OSPF control card 12, a 
Q lock can be maintained on creating an OSPF adjacency such that a new OSPF adjacency 

g 15 is not established during the initial synchronization. 

h 

% Hidden OSPF interface 30 is created on both active OSPF control card 12 and 

P standby OSPF control card 18 for each area during initial synchronization. An area refers 

1 to a group of contiguous networks and attached hosts. Hidden OSPF interface 30 is a 

^ point-to-point unnumbered interface which is used with system 10 and is not exposed to 

II 20 the outside world. Hidden OSPF adjacency 32 is built automatically over hidden OSPF 

2 interface 30 due to OSPF neighbor discovery. Database 33 is synchronized through 
hidden OSPF adjacency 32. Accordingly, there is one hidden OSPF adjacency 32 
between active OSPF control card 12 and standby OSPF control card 18 for each area. 
Accordingly, hidden OSPF adjacencies 32 can be used to synchronize link state database 

25 information 23 stored in database 33. 

Fig. 3 illustrates an implementation of hidden OSPF interfaces. Router 1 1 has 
two interfaces, interface 14a belongs to area 0 connecting to router 15a, and interface 14b 
belongs to area 2 connecting to Router 15b. In router 11, two hidden OSPF interfaces are 
created for area 0 and area 2, hidden interface 30a is created for area 0, and hidden 

30 interface 30b is created for area 2. Hidden OSPF adjacency 32a runs over hidden OSPF 
interface 30a, and hidden OSPF adjacency 32b runs over hidden OSPF interface 30b. 
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External link state advertisements (LSAs) are synchronized through hidden interface 30a 
for area 0 only. 

Referring to Fig. 2, active OSPF control card 12 and standby OSPF control card 
18 processes OSPF packets and calculates the shortest path first which decides the 

5 shortest path from a router to a destination network by considering cost. Active OSPF 
control card 12 can send OSPF packets to the line card for transmission to neighbor 
routers. Standby OSPF control card 18 does not send any OSPF packets to the line card 
for transmission to neighbor routers. Active OSPF control card 12 and standby OSPF 
control card 18 route updates to routing table manager (RTM) 34, as shown in Fig. 2. 

10 RTM 34 of standby OSPF control card 18 can update redistribution routes to active OSPF 
control card 12. IP interface manager 35 interfaces system 10 to the Internet Protocol 
(IP). Command Line Interface (CLI) commands are used to provide the OSPF 
configuration using datastore 36. Datastore 36 is a task that is responsible for providing 
storage in memory 38. For example, memory 38 can be a compact flash disc. 

15 Accordingly, all information obtained by standby OSPF control card 18 is directly 

obtained from either active OSPF control card 12, IP interface manager 35 or datastore 
36. 

An active state is associated with active OSPF control card 12. A standby state is 
associated with standby OSPF control card 18. A switchover from active OSPF control 
20 card 12 to standby OSPF control card 18 can clear upon failure of active OSPF control 
card 12. When a switchover occurs, standby OSPF control card 18 changes its state to 
active and takes over all OSPF operations. Standby OSPF control card 19 resumes any 
suppressed OSPF actions and begins sending OSPF packets to the line card. 

Fig. 4 is a schematic diagram of states of an active OSPF process 40 running on 
25 active OSPF control card 12. OSPF_FAULT_INIT state 41 is an initial state of active 
OSPF process 40. If system 10 is operating with only active OSPF control card 12 
operating, system 10 remains in OSPF-FAULT INIT state 41 awaiting initiation of a 
standby OSPF control card 18. 

Once standby OSPF control card 18 begins operating, OSPF_FAULT_VERIFY 
30 state 42 is entered in which standby OSPF control card 18 installs OSPF configuration 
information 25 received from data store 36 of active OSPF control card 12 which OSPF 
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configuration has been activated on active OSPF control card 12, as shown in Fig. 2. At 
this time the configuration on active OSPF control card 12 is disabled. OSPF 
configuration on standby OSPF control card 18 from data store 36 is synchronized and 
verified with information of active OSPF process 40. Active OSPF process 40 verifies 
5 whether standby OSPF process 44 running on standby OSPF control card 18 has a totally 
synchronous configuration and system information from data store 36. For example, 
active OSPF control card 12 can verify the interface number and parameters. If the 
verification fails, active OSPF process 40 can retry after a predetermined time interval, 
such as a few seconds. 
10 After verification of the OSPF configuration, active OSPF processes 40 and 

standby OSPF process 44 enter OSPFFAULTSYNC state 45. In 
OSPF_FAULT_SYNC state 45 neighbor information is transferred over communication 

5 link 16 between active OSPF control card 12 and standby OSPF control card 18, as 

6 shown in block 50 of Fig. 5. Neighbor information can be transferred from active OSPF 
g 15 process 40 as an IPC message. A plurality of IPC messages can be used to send a large 

| number of neighbors. Standby OSPF process 44 acknowledges the received IPC message 

P and sends an acknowledged IPC message to active OSPF control card 12, as shown in 

3 block 52. 

During forwarding of neighbor information, active OSPF control card 12 will not 
|| 20 accept any new neighbors by ignoring Hello packets from unknown persons. Once all 
T neighbor information has been transferred from active OSPF control card 12 to standby 

OSPF control card 18, active OSPF control card 12 will forward an end message, as 

shown in block 53. 

Thereafter, standby OSPF process 44 downloads link-state database information 
25 from active OSPF control card 12, in block 54. Link-state database information can be 
synchronized with the use of the internal database synchronization mechanism provided 
by OSPF, as described in RFC 2328 hereby incorporated by reference into this 
application. The database synchronization uses a "Database Exchange Process" in which 
each router describes its database by sending a sequence of Database Description packets 
30 to its neighbor. The two routers enter a master/slave relationship. Each Database 

Description Packet describes a set of LSA's belonging to the router's database. When a 
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neighbor sees an LSA that is more recent than its own database copy, it makes a note that 
the newer LSA should be requested. Each Database Description packet has a sequence 
number. Database Description packets (Polls) sent by the master are acknowledged by 
the slave by echoing the sequence number. Both Polls and responses contain summaries 
5 of link state data. The master is the only one allowed to retransmit Database Description 
Packets which can be done at fixed intervals. When the Database Description Process 
has completed, the databases are deemed synchronized and the routers are marked fully 
adjacent. At this time the adjacency is fully functional and is advertised in the two 
routers-LSA's. Hidden OSPF adjacency 32 is determined between active OSPF control 
10 card 12 and standby OSPF control card 18 for downloading the link-state database 

information 23. Upon receipt of a database requirement message at active OSPF control 
card 12 from standby OSPF control card 18, active OSPF control card 12 is aware that 
J standby OSPF control card 18 is starting to download link-state database information 23. 

1 Downloading of link-state database information continues until a synchronous link-state 
p 15 database exists in active OSPF control card 12 and standby OSPF control card 18. 

J After standby OSPF control card 18 has a synchronous link-state database with 

p active OSPF control card 12, active OSPF control card 12 and standby OSPF control card 

3 18 enter OSPFJ?AULT_FULL state 46. OSPFJFAULTJFULL state 46 is a hot standby 

state in which standby OSPF control card 1 8 can immediately take over all operations of 
y 20 active OSPF control card 12 upon failure. In OSPF FAULT FULL state 46, hidden 

2 OSPF interfaces 30 and hidden adjacencies 32 are removed. Active OSPF process 40 
incrementally updates any changes to standby OSPF process 44 by immediately sending 
updated OSPF state information 24, OSPF configuration information 25, OSPF 
adjacencies information 26, OSPF interface information 27 and OSPF global protocol 

25 information 28 to standby OSPF control card 18 through RCM 20 using IPC messages. 
Any neighbor state or loss of a neighbor adjacency changes to active OSPF control card 
12 are immediately transferred to standby OSPF control card 18 over communication link 
18. Any link-state database change is transferred to backup OSPF control card 18 with 
conventional OSPF synchronization mechanisms over communication link 15. 

30 Configuration changes in the active OSPF control card can be forwarded to 

backup OSPF control card 18 as an IPC message to trigger standby OSPF control card 18 
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to read updated information from data store 36. Alternatively, a configuration command 
can be forwarded from CLI to backup OSPF control module 18. 

If a failure of active OSPF control card 12 occurs when standby OSPF control 
card 18 is in the OSPF_FAULTJFULL state, the standby OSPF control card 18 
immediately takes over all OSPF operations. If a failure of active OSPF control card 12 
occurs when standby OSPF control card 18 is in one of the states of OSPFJFAULTINIT 
state 41, OSPF_FAULT_VERIFY state 12 or OSPFJFAULT_SYNC state 45, it indicates 
that the standby is not in a full redundant state, and the standby card will be reset. 
Because the system has not reached a redundant state, a failure of the active card will 
interrupt the service. 

It is to be understood that the above-described embodiments are illustrative of 
only a few of the many possible specific embodiments which can represent applications 
of the principles of the invention. Numerous and varied other arrangements can be 
readily devised in accordance with these principles by those skilled in the art without 
departing from the spirit and scope of the invention. 
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